SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Somerville, Chris 

Broun, Pierre 

(ii) TITLE OF INVENTION: Production of Hydroxylated Fatty Acids in 

Genetically Modified Plants 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 



(A) 


ADDRESSEE: 


CUSHMAN DARBY 


& CUSHMAN, LLP 


(B) 


STREET : 


1100 NEW YORK 


AVENUE, N.W. 


(C) 


CITY: 


WASHINGTON 




(D) 


STATE : 


D.C. 




(E) 


COUNTRY : 


USA 




(F) 


ZIP: 


20005-3918 





3 .. - 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.44 KB storage 

(B) COMPUTER: IBM PC 

(C) OPERATING SYSTEM: DOS 5.0 

m (D) SOFTWARE: Word Perfect 5.1 

7 (vi) CURRENT APPLICATION DATA; 

□ (A) APPLICATION NUMBER: unassigned 

m (B) FILING DATE: 

;;j (C) CLASSIFICATION: 

!~ (vii) ATTORNEY /AGENT INFORMATION: 

;~J (A) NAME: Paul N. Kokulis 

! ^ (B) REGISTRATION NUMBER: 16,773 

(C) REFERENCE/ DOCKET NUMBER: 1220/213781 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202 861 3000 

(B) TELEFAX: 202 822 0944 

(2) INFORMATION FOR SEQ ID NO:l 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TATTGGCACC GGCGGCACCA TTCCAACAAT GGATCCCTAG AAAAAGATGA AGTCTTTGTC 60 
CCACCTAAGA AAGCTGCAGT CANATGGTAT GTCAAATACC TCAACAACCC TCTTGGACGC 12 0 
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ATTCTGGTGT TAACAGTTCA GTTTATCCTC GGGTGGCCTT TGTATCTAGC CTTTAATGTA 180 

TCAGGTAGAC CTTATGATGG TTTCGCTTCA CATTTCTTCC CTCATGCACC TATCTTTAAG 240 

GACCGTGAAC GTCTCCAGAT ATACATCTCA GATGCTGGTA TTCTAGCTGT CTGTTATGGT 3 00 

CTTTACCGTT ACGCTGCTTC ACAAGGATTG ACTGCTATGA TCTGCGTCTA CGGAGTACCG 360 

CTTTTGATAG TGAACTTTTT CCTTGTCTTG GTCACTTTCT TGCAGCACAC TCATCCTTCA 420 

TTACCTCACT ATGATTCAAC CGAGTGGGAA TGGATTAGAG GAGCTTTGGT TACGGTAGAC 480 

AGAGACTATG GAATCTTGAA CAAGGTGTTT CACAACATAA CAGACACCCA CGTAGCACAC 540 

CAC 54 3 



42) INFORMATION FOR SEQ ID NO: 2 
| (i) SEQUENCE CHARACTERISTICS: 

£ (A) LENGTH: 544 nucleotides 

= (B) TYPE: nucleotide 

2 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

; ry (ii) MOLECULE TYPE: cDNA 

id (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

IJ ; 

n 1 

Itataggcacc GGAGGCACCA TTCCAACACA GGATCCCTCG AAAGAGATGA AGTATTTGTC 60 

^CCAAAGCAGA AATCCGCAAT CAAGTGGTAC GGCGAATACC TCAACAACCC TCCTGGTCGC 120 

ATCATGATGT TAACTGTCCA GTTCGTCCTC GGATGGCCCT TGTACTTAGC CTTCAACGTT 180 

TCTGGCAGAC CCTACAATGG TTTCGCTTCC CATTTCTTCC CCAATGCTCC TATCTACAAC 2 40 

GACCGTGAAC GCCTCCAGAT TTACATCTCT GATGCTGGTA TTCTAGCCGT CTGTTATGGT 3 00 

CTTTACCGTT ACGCTGTTGC ACAAGGACTA GCCTCAATGA TCTGTCTAAA CGGAGTTCCG 3 60 

CTTCTGATAG TTAACTTTTT CCTCGTCTTG ATCACTTACT TACAACACAC TCACCCTGCG 420 

TTGCCTCACT ATGATTCATC AGAGTGGGAT TGGCTTAGAG GAGCTTTAGC TACTGTAGAC 4 80 

AGAGACTATG GAATCTTGAA CAAGGTGTTC CATAACATCA CAGACACCCA CGTCGCACAC 54 0 

CACT 544 



(2) INFORMATION FOR SEQ ID NO : 3 
(i) SEQUENCE CHARACTERISTICS: 
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4 



% > 



(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: genomic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGAAGCTTT ATAAGAAGTT AGTTTTCTCT GGTGACAGAG AAATTNTGTC AATTGGTAGT 60 

GACAGTTGAA GCAACAGGAA CAACAAGGAT GGTTGGTGNT GATGCTGATG TGGTGATGTG 120 

TTATTCATCA AATACTAAAT ACTACATTAC TTGTTGCTGC CTACTTCTCC TATTTCCTCC 180 

GCCACCCATT TTGGACCCAC GANCCTTCCA TTTAAACCCT CTCTCGTGCT ATTCACCAGA 240 

jsJgagaagcca AGAGAGAGAG AGAGAGAATG TTCTGAGGAT CATTGTCTTC TTCATCGTTA 3 00 

j|TAACGTAAG TTTTTTTTGA CCACTCATAT CTAAAATCTA GTACATGCAA TAGATTAATG 3 60 

[JjCTGTTCCTT CTTTTGATAT TTTCAGCTTC TTGAATTCAA GATGGGTGCT GGTGGAAGAA 420 

j§AATGGTTAC CCCCTCTTCC AAGAAATCAG AAACTGAAGC CCTAAAACGT GGACCATGTG 480 

JAGAAACCACC ATTCACTGTT AAAGATCTGA AGAAAGCAAT CCCACAGCAT TGTTTCAAGC 540 

jlCTCTATCCC TCGTTCTTTC TCCTACCTTC TCACAGATAT CACTTTAGTT TCTTGCTTCT 600 

IACTACGTTGC CACAAATTAC TTCTCTCTTC TTCCTCAGCC TCTCTCTACT TACCTAGCTT 660 

fsGGCCTCTCTA TTGGGTATGT CAAGGCTGTG TCTTAACCGG TATCTGGGTC ATTGGCCATG 720 

AATGTGGTCA CCATGCATTC AGTGACTATC AATGGGTAGA TGACACTGTT GGTTTTATCT 780 

TCCATTCCTT CCTTCTCGTC CCTTACTTCT CCTGGAAATA CAGTCATCGT CGTCACCATT 840 

CCAACAATGG ATCTCTCGAG AAAGATGAAG TCTTTGTCCC ACCGAAGAAA GCTGCAGTCA 900 

AATGGTATGT TAAATACCTC AACAACCCTC TTGGACGCAT TCTGGTGTTA ACAGTTCAGT 960 

TTATCCTCGG GTGGCCTTTG TATCTAGCCT TTAATGTATC AGGTAGACCT TATGATGGTT 1020 

TCGCTTCACA TTTCTTCCCT CATGCACCTA TCTTTAAAGA CCGAGAACGC CTCCAGATAT 1080 

ACATCTCAGA TGCTGGTATT CTAGCTGTCT GTTATGGTCT TTACCGTTAC GCTGCTTCAC 1140 

AAGGATTGAC TGCTATGATC TGCGTCTATG GAGTACCGCT TTTGATAGTG AACTTTTTCC 1200 

TTGTCTTGGT AACTTTCTTG CAGCACACTC ATCCTTCGTT ACCTCATTAT GATTCAACCG 12 60 

AGTGGGAATG GATTAGAGGA GCTTTGGTTA CGGTAGACAG AGACTATGGA ATATTGAACA 132 0 

AGGTGTTCCA TAACATAACA GACACACATG TGGCTCATCA TCTCTTTGCA ACTATACCGC 1380 
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ATTATAACGC 


AATGGAAGCT 


ACAGAGGCGA 


TAAAGCCAAT 


ACTTGGTGAT 


TACTACCACT 


1440 


TCGATGGAAC 


ACCGTGGTAT 


GTGGCCATGT 


ATAGGGAAGC 


AAAGGAGTGT 


CTCTATGTAG 


1500 
\j \j 


AACCGGATAC 


GGAACGTGGG 


AAGAAAGGTG 


TCTACTATTA 


CAACAATAAG 


TTATGAGGCT 


1560 


GATAGGGCGA 


GAGAAGTGCA 


ATTATCAATC 


TTCATTTCCA 


TGTTTTAGGT 


GTCTTGTTTA 


1620 

^ v<» 


AGAAGCTATG 


CTTTGTTTCA 


ATAATCTCAG 


AGTCCATNTA 


GTTGTGTTCT 


GGTGCATTTT 


1 con 


GCCTAGTTAT 


GTGGTGTCGG 


AAGTTAGTGT 


TCAAACTGCT 


TCCTGCTGTG 


CTGCCCAGTG 


1740 


AAGAACAAGT 


TTACGTGTTT 


AAAATACTCG 


GAACGAATTG 


ACCACAANAT 


ATCCAAAACC 


1800 


GGCTATCCGA 


ATTCCATATC 


CGAAAACCGG 


ATATCCAAAT 


TTCCAGAGTA 


CTTAG 


1855 



4;2) INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

!2 < A > LENGTH: 384 amino acids 

yi (B) TYPE: amino acid 

^ (C) STRANDEDNESS: 

]» (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{31 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



10 

Met Gly Ala Gly Gly Arg He Met Val Thr Pro Ser Ser Lys Lys Ser 

20 30 
Glu Thr Glu Ala Leu Lys Arg Gly Pro Cys Glu Lys Pro Pro Phe Thr 

40 

Val Lys Asp Leu Lys Lys Ala He Pro Gin His Cys Phe Lys Arg Ser 

50 60 
He Pro Arg Ser Phe Ser Tyr Leu Leu Thr Asp He Thr Leu Val Ser 

70 80 
Cys Phe Tyr Tyr Val Ala Thr Asn Tyr Phe Ser Leu Leu Pro Glh Pro 

90 

Leu Ser Thr Tyr Leu Ala Trp Pro Leu Tyr Trp Val Cys Gin Gly Cys 

100 110 
Val Leu Thr Gly He Trp Val He Gly His Glu Cys Gly His His Ala 

120 

Phe Ser Asp Tyr Gin Trp Val Asp Asp Thr Val Gly Phe He Phe His 
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130 140 
Ser Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg 

150 160 
His His Ser Asn Asn Gly Ser Leu Glu Lys Asp Glu Val Phe Val Pro 

170 

Pro Lys Lys Ala Ala Val Lys Trp Tyr Val Lys Tyr Leu Asn Asn Pro 

180 190 
Leu Gly Arg lie Leu Val Leu Thr Val Gin Phe lie Leu Gly Trp Pro 

200 

Leu Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala 

210 220 
Ser His Phe Phe Pro His Ala Pro lie Phe Lys Asp Arg Glu Arg Leu 

230 240 
iGln lie Tyr lie Ser Asp Ala Gly lie Leu Ala Val Cys Tyr Gly Leu 

250 

Tyr Arg Tyr Ala Ala Ser Gin Gly Leu Thr Ala Met lie Cys Val Tyr 

260 270 
Gly Val Pro Leu Leu lie Val Asn Phe Phe Leu Val Leu Val Thr Phe 

^ 280 
mi Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Thr Glu Trp 

\1 290 300 

i-jGlu Trp lie Arg Gly Ala Leu Val Thr Val Asp Arg Asp Tyr Gly lie 

310 320 
Leu Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His 

330 

Leu Phe Ala Thr lie Pro His Tyr Asn Ala Met Glu Ala Thr Glu Ala 

340 350 
lie Lys Pro lie Leu Gly Asp Tyr Tyr His Phe Asp Gly Thr Pro Trp 

360 

Tyr Val Ala Met Tyr Arg Glu Ala Lys Glu Cys Leu Tyr Val Glu Pro 

370 380 
Asp Thr Glu Arg Gly Lys Lys Gly Val Tyr Tyr Tyr Asn Asn Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

10 

Met Gly Gly Gly Gly Arg Met Ser Thr Val lie Thr Ser Asn Asn Ser 

20 30 
Glu Lys Lys Gly Gly Ser Ser His Leu Lys Arg Ala Pro His Thr Lys 

40 

Pro Pro Phe Thr Leu Gly Asp Leu Lys Arg Ala lie Pro Pro His Cys 

50 60 
(□Phe Glu Arg Ser Phe Val Arg Ser Phe Ser Tyr Val Ala Tyr Asp Val 

m 70 80 

IB Cys Leu Ser Phe Leu Phe Tyr Ser lie Ala Thr Asn Phe Phe Pro Tyr 



90 

tjlle Ser Ser Pro Leu Ser Tyr Val Ala Trp Leu Val Tyr Trp Leu Phe 



100 110 

□Gin Gly Cys lie Leu Thr Gly Leu Trp Val lie Gly His Glu Cys Gly 

IB? " 

His His Ala Phe Ser Glu Tyr Gin Leu Ala Asp Asp lie Val Gly Leu 



130 140 
lie Val His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser 

150 160 
His Arg Arg His His Ser Asn lie Gly Ser Leu Glu Arg Asp Glu Val 

170 

Phe Val Pro Lys Ser Lys Ser Lys lie Ser Trp Tyr Ser Lys Tyr Ser 

180 190 
Asn Asn Pro Pro Gly Arg Val Leu Thr Leu Ala Ala Thr Leu Leu Leu 

200 

Gly Trp Pro Leu Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp 

210 220 
Arg Phe Ala Cys His Tyr Asp Pro Tyr Gly Pro lie Phe Ser Glu Arg 

230 240 
Glu Arg Leu Gin lie Tyr lie Ala Asp Leu Gly lie Phe Ala Thr Thr 

250 

Phe Val Leu Tyr Gin Ala Thr Met Ala Lys Gly Leu Ala Trp Val Met 



260 270 
Arg lie Tyr Gly Val Pro Leu Leu lie Val Asn Cys Phe Leu Val Met 

280 

lie Thr Tyr Leu Gin His Thr His Pro Ala lie Pro Arg Tyr Gly Ser 

290 300 
Ser Glu Trp Asp Trp Leu Arg Gly Ala Met Val Thr Val Asp Arg Asp 

310 320 
Tyr Gly Val Leu Asn Lys Val Phe His Asn lie Ala Asp Thr His Val 

330 

Ala His His Leu Phe Ala Thr Val Pro His Tyr His Ala Met Glu Ala 

340 350 
Thr Lys Ala lie Lys Pro lie Met Gly Glu Tyr Tyr Arg Tyr Asp Gly 

Jn 360 

m Thr Pro Phe Tyr Lys Ala Leu Trp Arg Glu Ala Lys Glu Cys Leu Phe 

^Val Glu Pro Asp Glu Gly Ala Pro Thr Gin Gly Val Phe Trp Tyr Arg 

™ Asn Lys Tyr 



_ ) INFORMATION FOR SEQ ID NO: 6 
" (i) SEQUENCE CHARACTERISTICS: 

-~ (A) LENGTH: 38 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

10 

Met Gly Ala Gly Gly Arg Met Pro Val Pro Thr Ser Ser Lys Lys Ser 

20 30 
Glu Thr Asp Thr Thr Lys Arg Val Pro Cys Glu Lys Pro Pro Phe Ser 

40 

Val Gly Asp Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 

50 60 
lie Pro Arg Ser Phe Ser Tyr Leu lie Ser Asp lie lie lie Ala Ser 
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Cys Phe Tyr Tyr Val A Ia Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 

Leu Ser Tyr Leu Ala Trp Pro Leu Tyr Tr£ Ala Cys Gin Gly Cys Val 

100 

Leu Thr Gly He Trp Val He Ala His Glu Cys Gly His Hil Ala Phe 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu n e Phe His Ser 

130 

Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 

150 

Hxs ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
Gin Lys Ser Ala He Lys Trp Tyr Gly £ys Tyr Leu Asn Asn Pro Leu 
| GlY Arg Ile ™* Met Thr Val Gin Phe Val Leu Gly Trp Pro Leu 



200 



w Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala Cys 

O 210 

| His Phe Phe Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu Gin 



230 



He Tyr Leu ser Asp Ala Gly He Leu Ala Val Cys Phe Gly Leu Tyr 



250 



Arg Tyr Ala Ala Ala Gin Gly Met Ala Ser Met He Cys Leu Tyr Gly 
Val Pro Leu Leu" He Val Asn Ala Phe Leu Val Leu lie ?hr Tyr Leu 



280 



Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp Asp 



290 

300 



Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He Leu 



310 



Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His Leu 



s Ala Ile 



Phe Ser Thr Met Pro His Tyr Asn Ala Met Glu Ala Thr Ly 
Lys Pro lie Leu Gly Asp Tyr Tyr Gin Phe Asp Gly Thr Pro Trp Tyr 
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360 

Val Ala Met Tyr Arg Glu Ala Lys Glu Cys lie Tyr Val Glu Pro Asp 



370 380 
Arg Glu Gly Asp Lys Lys Gly Val Tyr Trp Tyr Asn Asn Lys Leu 



2) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
3(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

3 10 

IMet Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 

3 20 30 

J Glu Thr Asp Asn lie Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 

3 40 

i Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 

50 60 
, lie Pro Arg Ser Phe Ser His Leu lie Trp Asp lie lie lie Ala Ser 

70 80 
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro Asn Pro 

90 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 

100 110 
Leu Thr Gly Val Trp Val lie Ala His Glu Cys Gly His Ala Ala Phe 

120 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu lie Phe His Ser 

130 140 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 

150 160 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Arg 

170 

Arg Ser Gin Thr Ser Ser Gly Thr Ala Ser Thr Ser Thr Thr Phe Gly 



4. i 



Arg Thr Val iSet° Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu ^ 
Leu Ala Phe Asn V al ser Gly f°° pro ^ ^ ^ 

210 

His Phe His Pro Asn Ala r>*-~ -ri ™ 220 

Pro Asn Ala Pro He Tyr Asn Asp ^ Glu ^ ^ 

230 

He Tyr He Ser Asp Ala Gly n e Leu Ala v al ^, m 240 

j-y xie Leu Ala Val Cys Tyr Gly Leu Leu 

-» Tyr Ala Ala val G1 „ Gly val 2|» ^ ^ ^ ^ ^ 

Val Pro Leu "u ne Val Asn 01y Phe ^ Val ^ ^ "0 ^ ^ 
pn His Thr His Pro Ser Leu Pro His T yr Asp Ser Ser Glu Trp Asp 

0 290 



glTrp Leu Arg Gly Ala Leu Ala Thr- v al a~ » 300 

^ Ala Thr Val As P Arg Asp Tyr Gly n e Leu 

f~ " ° ly ^ HiS AS " Ile Thr *=P Thr His Glu Ala His His Leu 
, >Be S " Thr MSt Pro Hi = ^ M- Ala "e? Glu Ala Thr Lys Ala He 

<4 340 

J.ys Pro ne Leu Gly Glu Tyr Tyr Gin Ph e Asp Gly Thr III Val Val 
Lys Ala Ket Trp Arg Giu Ala Lys Glu Cys lie Tyr Val Glu Pro Asp 
Arg lli Gly Glu Lys Lys Gly Val Phe Trp Tyr 11° Asn Lys Leu Xaa 



(2) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS- 

(B) TYPE™ 5 " 3 ° 9 amino 

(c) strJndedness: amin ° acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



- 74 




4 * /► 



ser Leu Leu Thr Ser Phe Ser Tyr Val v" Tyr Asp Leu Ser 



Phe Ala 



20 



Phe He Phe Tyr He Ala Thr Thr Tyr Phe 

40 



30 



His Leu Leu Pro Gin Pro 



Phe ser Leu He Ala Trp Pro Ile Tyr Trp Val ^ ^ ^ ^ ^ 

I- ^ Ar, v al Cys Gly ,u His Ala Phe Ser Ly° Tyr Cln Trp Val 
Asp Asp val val Gly Leu Thr „ Hls Ser Thr ^ ^ 

Phe ser Trp Lys He Ser His Arg Arg H?s His Ser As„ Thr Gly ser 



80 

Pro Tyr 



100 



Leu Asp Ar, Asp Glu Arg Val Lys Val Ala Trp P„e Ser Lys° Tyr Leu 



120 



£ Asn Asn Pro Leu Gly Arg Ala Val Ser 



130 



Leu Leu Val Thr Leu Thr II, 



Oly Trp Pro Met Tyr Leu Ala Phe Asn Val Ser g1° Arg Pro Tyr Asp 



150 



3 5 



Ser Phe Ala Ser His Tyr His Pro Tvr ™ -, » 160 

yj. nis fro Tyr Arg Val Arg Leu Leu lie Tyr 

Val Ser Asp Val Ala Leu Phe Ser Val Thr" Tvr- ^ t m 

vax xnr Tyr Ser Leu Tyr Arg Val 



180 



Ala Thr Leu Lys Gly Leu Val Trp Leu Leu 

200 



190 



Cys Val Tyr Gly val Pro 



210 



His Tyr Asp ser Ser Glu Trp Asp Trp Leu Lys §ty Ala Leu Ala Thr 
Met Asp Arg Asp Tyr Gly He Leu Asn Lys Val Phe His His He In? 



Asp Thr His Val Ala His His Leu P„e Ser Thr Met Pro His Tyr Hi 



260 



Leu Arg Val Lys Pro xle Leu Gly Glu Tyr Tyr Gin Phe Asp° Asp Thr 



280 



Pro Phe Tyr Lys Ala Leu Trp Arg Glu Al 



a Arg Glu Cys Leu Tyr Val 
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290 300 
Glu Pro Asp Glu Gly Thr Ser Glu Lys Gly Val Tyr Trp Tyr Arg Asn 



Lys Tyr Leu Arg Val 



(2) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 302 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
=^(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



10 

^Phe Ser Tyr Val Val Tyr Asp Leu Thr lie Ala Phe Cys Leu Tyr Tyr 

Z 20 30 

rfVal Ala Thr His Tyr Phe His Leu Leu Pro Gly Pro Leu Ser Phe Arg 

40 

r[Gly Met Ala lie Tyr Trp Ala Val Gin Gly Cys lie Leu Thr Gly Val 
y 50 60 

ifTrp Val Val Ala Phe Ser Asp Tyr Gin Leu Leu Asp Asp lie Val Gly 

^ 70 80 

Leu lie Leu His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr 

90 

Ser His Arg Arg His His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu 

100 110 
Val Phe Val Pro Lys Val Ser Lys Tyr Leu Asn Asn Pro Pro Gly Arg 

120 

Val Leu Thr Leu Ala Val Thr Leu Thr Leu Gly Trp Pro Leu Tyr Leu 

130 140 
Ala Leu Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe Ala Cys His Tyr 

150 160 
Asp Pro Tyr Gly Pro lie Tyr Ser Val lie Ser Asp Ala Gly Val Leu 

170 

Ala Val Val Tyr Gly Leu Phe Arg Leu Ala Met Ala Lys Gly Leu Ala 

180 190 
Trp Val Val Cys Val Tyr Gly Val Pro Leu Leu Val Val Asn Gly Phe 



200 

Leu Val Leu lie Thr Phe Leu Gin His Thr His Val Ser Glu Trp Asp 

210 220 
Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie Leu 

230 240 
Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His Leu 

250 

Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Val Glu Tyr 

260 270 
Tyr Arg Phe Asp Glu Thr Pro Phe Val Lys Ala Met Trp Arg Glu Ala 

280 

Arg Glu Cys lie Tyr Val Glu Pro Asp Gin Ser Thr Glu Ser Lys Gly 

290 300 
Val Phe Trp Tyr Asn Asn Lys Leu Ala Met Glu Ala Thr Val 



INFORMATION FOR SEQ ID NO: 10 

Jg ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 372 amino acids 

;U (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

10 

Met Gly Ala Gly Gly Arg Met Thr Glu Lys Glu Arg Glu Lys Gin Glu 

20 30 
Gin Leu Ala Arg Ala Thr Gly Gly Ala Ala Met Gin Arg Ser Pro Val 

40 

Glu Lys Pro Pro Phe Thr Leu Gly Gin lie Lys Lys Ala lie Pro Pro 

50 60 
His Cys Phe Glu Arg Ser Val Leu Lys Ser Phe Ser Tyr Val Val His 

70 80 
Asp Leu Val lie Ala Ala Ala Leu Leu Tyr Phe Ala Leu Ala lie lie 

90 

Pro Ala Leu Pro Ser Pro Leu Arg Tyr Ala Ala Trp Pro Leu Tyr Trp 

100 110 
lie Ala Gin Gly Ala Phe Ser Asp Tyr Ser Leu Leu Asp Asp Val Val 



Gly Leu Val Leu Hie o 5 20 

Leu His Ser Ser Leu Met Val Pro Ty r Phe Ser Trp hys 

Tyr ser His Arg Arg His His « ™_ 140 

g His His ser Asn Thr Gly ser Leu Glu Arg Asp 

«» val Phe val Pro £ys° Lys Lys Glu Ala Leu Pro Xrp Tyr Thr £J 
Tyr val Tyr Asn Asn Pro Val Gly Ar, ill Val Hls Ile w ^ ^ 
Leu Thr Leu 5£ Trp Pro Leu Tyr Leu Ala Thr Asn Ala ll°r G ly ^ 
Pro Tyr Pro Arg Phe Ala Cys hL° Phe Asp Pro Tyr Gly Pro Xle Tyr 

3 210 * 

QAsn Asp Arg Glu Arg Ala Gin He Phe Val S er- 11° „ 

3 e val Ser Asp Ala Gly Val Val 



230 



in Ala Val Ala Phe Gly Leu Tyr Lys Leu A1 s 240 
u ^ys Leu Ala Ala Ala Phe Gly Val Trp 

1 Trp val val Ar, val Tyr Ala Val Pro Hi L eu xle val Asn Ala Trp 
S -u val Leu III Thr Tyr Leu Gin His Thr His Pro Ser £ Pro His 

2 ^ ~ G1U TrP AS » ^ *~ -g Gly Ala Leu Ala Thr Met 



290 



Asp Arg Asp Tyr Gly xle Leu Asn Arg Val Phe „i°s° Asn Xle Thr Asp 



310 



Thr His val Ala His His Leu Phe Ser Thr Met Pro His Tyr His III 
«et Glu Ala Thr Lys Ala Xle Arg Pro 11° L eu Gly Asp Tyr Tyr His 
*he Asp Pro Til" Pro Vai Ala Lys Ala Thr Trp Arg Glu Ala Gly oln 
=ys 10. Tyr val Glu Pro olu A |° ^ Lys ^ ^ ^ ^ ^ ^ 

Lys Lys Phe Xaa 



(2) INFORMATION FOR SEQ ID NO: 11 
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(X) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: "„-,„ 

(B) TYPE: . an, mo acids 

(C) STRANDEDNESS • ai ^no acid 

(D) TOPOLOGY: ' linear 

(ii) MOLECULE TYPE: ^ . 

protein 

(*D SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

r-u Leu Asp a2 p ° Val Val Gly Leu Ile Leu His ^ ^ ^0 ^ ^ 

Z ^ T - * - - ^ His His Ser Asn Thr 

^ ^ A - ^ G - ~ «- Va! Pro Lys Lys Lys Ser Ser 

I 1 " ^ ^ T ^ ^ ^ T - - *~ - Pro Pro Gly Arg Ile M ao 

Thr lie Ala Val Thr Leu 90 
^ t-iy Trp Pro Leu Tyr Leu Ala Phe 

100 



-n v al ser 01y ^ Pro Tyr Asp ^ ^ ^ ^ ^ £ ^ ^ 

^ - y Pro Ile Tyr Asn ftsp - ^ ^ ^ ^ ^ 

ASP A1 a „ y vax Leu Ala val Thr phe wy ^ JJJ ^ e 

Lys Gly Leu Ala % val ^ ^ G ^ v ^ ^ ^ ^ 
v al Asn ^ Phe ^ val Leu ^ g, ^ ws ^ ^ 

P« Ala L eu £ „ ls Tyr Asp s „ ser ^ ^ ^ ^ 

Ala Leu Ala Thr Val Asn a™ 2 °° m 

Asp Arg Asp Tyr Gly n e Leu Asn Lys Val Phe 

His Asn He Thr Asp Thr Gin Val Ala «• 220 

n vai Ala His His Leu Phe Thr Met Pro 
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(2) 



INFORMATION FOR SEQ ID NO:12 



(B) TYPE: 20 nu cleotides 

(C) STRANDEDNESS : ' * 

(D) TOPOLOGY; 



(ii) MOLECULE TYPE: 
(Xi) SEQUENCE DESCRIPTION: 
GCTCTTTTGT GCGCTCATTC 



nucleotide 
single 

linear 



DNA 

SEQ ID NO:l2: 



(2) 



INFORMATION FOR SEQ 1D 



Hfi) SEQUENCE 



NO: 13 



(A) LENG^ 0 ^ 15 "^: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 



II ii) MOLECULE TYPE: 
|xi) SEQUENCE DESCRIPTION: 
CGjiTACCAGA AAACGCCTTG 



20 nucleotides 
nucleotide 
single 

linear 



DNA 

SEQ ID NO: 13: 



INFORMATION FOR SEQ I D NO: 14 



(B) type: 20 nuc leotides 

(C) STRANDEDNESS: ™<=1<— -~ 

(D) TOPOLOGY: 



(ii) MOLECULE TYPE: 
Cxi) SEQUENCE DESCRIPTION: 
TAYWSNCAYM GNMGNCAYCA 



nucleotide 
single 

linear 



DNA 

SEQ ID NO: 14; 



(2) INFORMATION FOR SEQ I D NO: 15 

(B) TYPE: 21 nuc leotides 

(C) STRANDEDNESS: Jin^?^^ 6 

(D) TOPOLOGY: single 



(ii) MOLECULE TYPE: 
(xi) SEQUENCE DESCRIPTION 
RTGRTGNGCN ACRTGNGTRT C 




1ST - 



